Comparison of machine learning techniques for handling multicollinearity in big data analytics and high - performance data mining Gerard

نویسنده

  • Ghalib Bello
چکیده

§ The insights gained from this study could be useful in selecting machine-learning methods for automated pre-processing of thousands of correlated variables in biomedical data mining. Conclusions Comparison of machine learning techniques for handling multicollinearity in big data analytics and high-performance data mining Gerard G. Dumancas1* and Ghalib Bello2 *1Oklahoma Baptist University, Shawnee, OK, USA 74804, email: [email protected] 2Virginia Commonwealth University, Richmond, VA, USA 23284, email: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P-V-L Deep: A Big Data Analytics Solution for Now-casting in Monetary Policy

The development of new technologies has confronted the entire domain of science and industry with issues of big data's scalability as well as its integration with the purpose of forecasting analytics in its life cycle. In predictive analytics, the forecast of near-future and recent past - or in other words, the now-casting - is the continuous study of real-time events and constantly updated whe...

متن کامل

ارائه مدلی برای استخراج اطلاعات از مستندات متنی، مبتنی بر متن‌کاوی در حوزه یادگیری الکترونیکی

As computer networks become the backbones of science and economy, enormous quantities documents become available. So, for extracting useful information from textual data, text mining techniques have been used. Text Mining has become an important research area that discoveries unknown information, facts or new hypotheses by automatically extracting information from different written documents. T...

متن کامل

Handling Big Data Stream Analytics using SAMOA Framework - A Practical Experience

Data analytics and machine learning has always been of great importance in almost every field especially in business decision making and strategy building, in healthcare domain, in text mining and pattern identification on the web, in meteorological department, etc. The daily exponential growth of data today has shifted the normal data analytics to new paradigm of Big Data Analytics and Big Dat...

متن کامل

A Fuzzy TOPSIS Approach for Big Data Analytics Platform Selection

Big data sizes are constantly increasing. Big data analytics is where advanced analytic techniques are applied on big data sets. Analytics based on large data samples reveals and leverages business change. The popularity of big data analytics platforms, which are often available as open-source, has not remained unnoticed by big companies. Google uses MapReduce for PageRank and inverted indexes....

متن کامل

A Review on Big Data Analytics : An Eminent Approach for Handling an Outsized Data

The volatile increase of data volume and the growing demands of data mining have stimulated us into the era of big data. Many research scholars are drawn their desirability towards the research areas of big data mining, machine learning, computational intelligence and social networking. The big data technologies with conventional data mining approaches have posed many challenges in the field of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015